智能论文笔记

Enabling Massage Actions: An Interactive Parallel Robot with Compliant Joints

Huixu Dong , Yue Feng , Chen Qiu , Ye Pan , Miaoying He , I-Ming Chen

分类：机器人

2022-08-26

我们提出了一个基于串联弹性执行器（SEA）的平行按摩机器人，提供统一的力量控制方法。首先，建立了运动和静态力模型，以获得相应的控制变量。然后，提出了一种新型的力位控制策略，以在不需要机器人动力学模型的情况下分别控制沿表面正常方向的力位和另一个两方向位移。为了评估其性能，我们实施了一系列机器人按摩实验。结果表明，所提出的按摩操纵器可以成功实现按摩任务的所需力和运动模式，从而达到高得分用户体验。

translated by 谷歌翻译

GSG: A Granary Soft Gripper with Mechanical Force Sensing via 3-Dimensional Snap-Through Structure

Huixu Dong , Chao-Yu Chen , Chen Qiu , Chen-Hua Yeow , Haoyong Yu

分类：机器人

2021-11-07

Grasping是实际应用中大多数机器人的重要能力。软机器人夹具被认为是机器人抓握的关键部分，并在对象几何形状方差方差的高度和稳健性方面引起了相当大的关注;然而，它们仍然受到相应的传感能力和致动机制的限制。我们提出了一种新型软夹具，看起来像碎碎的碎碎片，其具有综合模具技术制造的柔顺的双稳态机构，纯粹机械地实现感测和致动。特别地，所提出的夹持器中的卡通双稳态结构允许我们降低机构的复杂性，控制，感测设计，因为抓握和感测行为是完全被动的。一旦夹持器的触发位置触及物体并施加足够的力，抓握行为就会自动激励。为了用各种型材抓住物体，所提出的粮食软夹具（GSG）设计为能够包封，夹紧和持续爪。夹具由腔掌，棕榈帽和三个手指组成。首先，分析夹具的设计。然后，在构造理论模型之后，进行有限元（FE）仿真以验证构建的模型。最后，进行了一系列掌握实验，以评估所提出的夹持器对抓握和感测的卡通行为。实验结果说明了所提出的夹持器可以操纵各种柔软和刚性物体，并且即使它承担外部干扰，也可以保持稳定。

translated by 谷歌翻译

RELIANT: Fair Knowledge Distillation for Graph Neural Networks

Yushun Dong , Binchi Zhang , Yiling Yuan , Na Zou , Qi Wang , Jundong Li

分类：机器学习

2023-01-03

Graph Neural Networks (GNNs) have shown satisfying performance on various graph learning tasks. To achieve better fitting capability, most GNNs are with a large number of parameters, which makes these GNNs computationally expensive. Therefore, it is difficult to deploy them onto edge devices with scarce computational resources, e.g., mobile phones and wearable smart devices. Knowledge Distillation (KD) is a common solution to compress GNNs, where a light-weighted model (i.e., the student model) is encouraged to mimic the behavior of a computationally expensive GNN (i.e., the teacher GNN model). Nevertheless, most existing GNN-based KD methods lack fairness consideration. As a consequence, the student model usually inherits and even exaggerates the bias from the teacher GNN. To handle such a problem, we take initial steps towards fair knowledge distillation for GNNs. Specifically, we first formulate a novel problem of fair knowledge distillation for GNN-based teacher-student frameworks. Then we propose a principled framework named RELIANT to mitigate the bias exhibited by the student model. Notably, the design of RELIANT is decoupled from any specific teacher and student model structures, and thus can be easily adapted to various GNN-based KD frameworks. We perform extensive experiments on multiple real-world datasets, which corroborates that RELIANT achieves less biased GNN knowledge distillation while maintaining high prediction utility.

translated by 谷歌翻译

Vocabulary-informed Zero-shot and Open-set Learning

Yanwei Fu , Xiaomei Wang , Hanze Dong , Yu-Gang Jiang , Meng Wang , Xiangyang Xue , Leonid Sigal

分类：计算机视觉 | 机器学习

2023-01-03

Despite significant progress in object categorization, in recent years, a number of important challenges remain; mainly, the ability to learn from limited labeled data and to recognize object classes within large, potentially open, set of labels. Zero-shot learning is one way of addressing these challenges, but it has only been shown to work with limited sized class vocabularies and typically requires separation between supervised and unsupervised classes, allowing former to inform the latter but not vice versa. We propose the notion of vocabulary-informed learning to alleviate the above mentioned challenges and address problems of supervised, zero-shot, generalized zero-shot and open set recognition using a unified framework. Specifically, we propose a weighted maximum margin framework for semantic manifold-based recognition that incorporates distance constraints from (both supervised and unsupervised) vocabulary atoms. Distance constraints ensure that labeled samples are projected closer to their correct prototypes, in the embedding space, than to others. We illustrate that resulting model shows improvements in supervised, zero-shot, generalized zero-shot, and large open set recognition, with up to 310K class vocabulary on Animal with Attributes and ImageNet datasets.

translated by 谷歌翻译

DensePose From WiFi

Jiaqi Geng , Dong Huang , Fernando De la Torre

分类：计算机视觉

2022-12-31

Advances in computer vision and machine learning techniques have led to significant development in 2D and 3D human pose estimation from RGB cameras, LiDAR, and radars. However, human pose estimation from images is adversely affected by occlusion and lighting, which are common in many scenarios of interest. Radar and LiDAR technologies, on the other hand, need specialized hardware that is expensive and power-intensive. Furthermore, placing these sensors in non-public areas raises significant privacy concerns. To address these limitations, recent research has explored the use of WiFi antennas (1D sensors) for body segmentation and key-point body detection. This paper further expands on the use of the WiFi signal in combination with deep learning architectures, commonly used in computer vision, to estimate dense human pose correspondence. We developed a deep neural network that maps the phase and amplitude of WiFi signals to UV coordinates within 24 human regions. The results of the study reveal that our model can estimate the dense pose of multiple subjects, with comparable performance to image-based approaches, by utilizing WiFi signals as the only input. This paves the way for low-cost, broadly accessible, and privacy-preserving algorithms for human sensing.

translated by 谷歌翻译

A Survey for In-context Learning

Qingxiu Dong , Lei Li , Damai Dai , Ce Zheng , Zhiyong Wu , Baobao Chang , Xu Sun , Jingjing Xu , Lei Li , Zhifang Sui

分类：自然语言处理 | 人工智能

2022-12-31

With the increasing ability of large language models (LLMs), in-context learning (ICL) has become a new paradigm for natural language processing (NLP), where LLMs make predictions only based on contexts augmented with a few training examples. It has been a new trend exploring ICL to evaluate and extrapolate the ability of LLMs. In this paper, we aim to survey and summarize the progress, challenges, and future work in ICL. We first present a formal definition of ICL and clarify its correlation to related studies. Then, we organize and discuss advanced techniques of ICL, including training strategies, prompting strategies, and so on. Finally, we present the challenges of ICL and provide potential directions for further research. We hope our work can encourage more research on uncovering how ICL works and improving ICL in future work.

translated by 谷歌翻译

Transformer in Transformer as Backbone for Deep Reinforcement Learning

Hangyu Mao , Rui Zhao , Hao Chen , Jianye Hao , Yiqun Chen , Dong Li , Junge Zhang , Zhen Xiao

分类：机器学习 | 人工智能 | 机器人

2022-12-30

Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL. This work focuses on the former. Previous methods build the network with several modules like CNN, LSTM and Attention. Recent methods combine the Transformer with these modules for better performance. However, it requires tedious optimization skills to train a network composed of mixed modules, making these methods inconvenient to be used in practice. In this paper, we propose to design \emph{pure Transformer-based networks} for deep RL, aiming at providing off-the-shelf backbones for both the online and offline settings. Specifically, the Transformer in Transformer (TIT) backbone is proposed, which cascades two Transformers in a very natural way: the inner one is used to process a single observation, while the outer one is responsible for processing the observation history; combining both is expected to extract spatial-temporal representations for good decision-making. Experiments show that TIT can achieve satisfactory performance in different settings, consistently.

translated by 谷歌翻译

Deep Temporal Contrastive Clustering

Ying Zhong , Dong Huang , Chang-Dong Wang

分类：机器学习

2022-12-29

Recently the deep learning has shown its advantage in representation learning and clustering for time series data. Despite the considerable progress, the existing deep time series clustering approaches mostly seek to train the deep neural network by some instance reconstruction based or cluster distribution based objective, which, however, lack the ability to exploit the sample-wise (or augmentation-wise) contrastive information or even the higher-level (e.g., cluster-level) contrastiveness for learning discriminative and clustering-friendly representations. In light of this, this paper presents a deep temporal contrastive clustering (DTCC) approach, which for the first time, to our knowledge, incorporates the contrastive learning paradigm into the deep time series clustering research. Specifically, with two parallel views generated from the original time series and their augmentations, we utilize two identical auto-encoders to learn the corresponding representations, and in the meantime perform the cluster distribution learning by incorporating a k-means objective. Further, two levels of contrastive learning are simultaneously enforced to capture the instance-level and cluster-level contrastive information, respectively. With the reconstruction loss of the auto-encoder, the cluster distribution loss, and the two levels of contrastive losses jointly optimized, the network architecture is trained in a self-supervised manner and the clustering result can thereby be obtained. Experiments on a variety of time series datasets demonstrate the superiority of our DTCC approach over the state-of-the-art.

translated by 谷歌翻译

On Deep Recurrent Reinforcement Learning for Active Visual Tracking of Space Noncooperative Objects

Dong Zhou , Guanghui Sun , Zhao Zhang , Ligang Wu

分类：机器人

2022-12-29

Active tracking of space noncooperative object that merely relies on vision camera is greatly significant for autonomous rendezvous and debris removal. Considering its Partial Observable Markov Decision Process (POMDP) property, this paper proposes a novel tracker based on deep recurrent reinforcement learning, named as RAMAVT which drives the chasing spacecraft to follow arbitrary space noncooperative object with high-frequency and near-optimal velocity control commands. To further improve the active tracking performance, we introduce Multi-Head Attention (MHA) module and Squeeze-and-Excitation (SE) layer into RAMAVT, which remarkably improve the representative ability of neural network with almost no extra computational cost. Extensive experiments and ablation study implemented on SNCOAT benchmark show the effectiveness and robustness of our method compared with other state-of-the-art algorithm. The source codes are available on https://github.com/Dongzhou-1996/RAMAVT.

translated by 谷歌翻译

A Deep Learning Method for Real-time Bias Correction of Wind Field Forecasts in the Western North Pacific

Wei Zhang , Yueyue Jiang , Junyu Dong , Xiaojiang Song , Renbo Pang , Boyu Guoan , Hui Yu

分类：人工智能 | 计算机视觉

2022-12-29

Forecasts by the European Centre for Medium-Range Weather Forecasts (ECMWF; EC for short) can provide a basis for the establishment of maritime-disaster warning systems, but they contain some systematic biases.The fifth-generation EC atmospheric reanalysis (ERA5) data have high accuracy, but are delayed by about 5 days. To overcome this issue, a spatiotemporal deep-learning method could be used for nonlinear mapping between EC and ERA5 data, which would improve the quality of EC wind forecast data in real time. In this study, we developed the Multi-Task-Double Encoder Trajectory Gated Recurrent Unit (MT-DETrajGRU) model, which uses an improved double-encoder forecaster architecture to model the spatiotemporal sequence of the U and V components of the wind field; we designed a multi-task learning loss function to correct wind speed and wind direction simultaneously using only one model. The study area was the western North Pacific (WNP), and real-time rolling bias corrections were made for 10-day wind-field forecasts released by the EC between December 2020 and November 2021, divided into four seasons. Compared with the original EC forecasts, after correction using the MT-DETrajGRU model the wind speed and wind direction biases in the four seasons were reduced by 8-11% and 9-14%, respectively. In addition, the proposed method modelled the data uniformly under different weather conditions. The correction performance under normal and typhoon conditions was comparable, indicating that the data-driven mode constructed here is robust and generalizable.

translated by 谷歌翻译